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Empirical central limit theorem for 
cluster functionals without mixing.* 

Paul Doukhan + and Jose-Gregorio Gomez^ 


Abstract 

We prove central limit theorems (CLT)for empirical processes of extreme values cluster 
functionals as in Drees and Rootzen (2010). We use coupling properties enlightened for 
Dedecker & Prieur's t— dependence coefficients in order to improve the conditions of depen¬ 
dence and continue to obtain these CLT. The assumptions are precisely set for particular 
processes and cluster functionals of interest. The number of excesses provides a complete 
example of a cluster functional for a simple non-mixing model (AR(l)-process) for which ours 
residts are definitely needed. We also give the expression explicit the covariance structure of 
limit Gaussian process. 

Also we include in this paper some results of Drees (2011) for the extremal index and 
some simulations for this index to demonstrate the accuracy of this technique. 

Keywords and phrases: Extremes, clustering of extremes, cluster functional of 
extremes, extremal index, uniform central limit theorem, z-weak dependence, tail 
empirical process. 


1. Introduction 


Drees & Rootzen [2010|'s scheme prove limit theorems for empirical process of cluster 


functionals (EPCF). In statistics, Gomez [20151 proves a CLT in finite dimensional conver¬ 
gence (fidis) under weaker conditions, and considers an example for which a functional 
CLT is not necessary. 

We extend the result under r-weak dependence. The classical example of a non-mixing 
autoregressive model demonstrates the importance of this functional extension. Moreover 
the estimation of the extremal index provides us with a suitable example of application of 
the functional CLT. 

For a real-valued random process (X,), e iy, a typical example of a EPCF is the tail 
empirical process: 


1 A 

T n (x) = —== Y2 (1{X > a n x + u n } — P(Xi > a n x + u n )), x ^ 0, (1) 
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where v n = P{ X„ j / 0} is decreasing to zero, (iin)neK is a non-decreasing sequence of 
thresholds and (fl n )weN is a sequence of positive constants. This process has heed consid¬ 
ered by Drees and Rootzen under suitable dependence conditions (in particular under 
oc and ^-mixing conditions), where they prove its uniform convergence to a Gaussian 
process T under additional conditions. For example, they prove the convergence of this 
tail empirical process for the cases of /c-dependent sequences or stable AR(l)-processes 
[Rootzen, 19951, ARCH(l)-processes [Drees, 2002, 2003] and some applications for solu¬ 
tions of stochastic difference equations [ Drees] 2000 2002 2003J. Finally, they use the 
cluster functionals setting in [Yun, 20001 and [Segers, 2003| to generalize such empirical 
processes under ^-mixing in [Drees & Rootzen, 20101. 

Unfortunately, the mixing processes family is very restrictive. This can be noted with 
the following AR(l)-process, solution of the recursion: 


x k = T {x k _ 1 + Zk), ke z. 


( 2 ) 


where b ^ 2 is an integer and (£k)ke N are independent and uniformly distributed random 
variables on the set U(b) := {0, 1,... ,b — 1} which is not even a—mixing, as this is shown 
in | Andrews 1984) for b = 2 and in [Ango Nze & Doukhan 2004| for b > 2. Thus, the 
results in flDrees & Rootzen 2010J f can not be used herel. However, such process Q is r-weakly 
dependent as is shown in | Dedecker & Prieur 2004a| . The same situation happens in a 
general way for the causal Bernoulli shifts, Markovian models, etc. This is thus useful to 
improve on the CFT for empirical processes of extreme cluster functionals proposed by 


Drees & Rootzen [2010] for more general classes of weakly dependent processes. 


In order to do this, we use of the coupling results of 

Dedecker & Prieur 

[2004a, 

2005| 

under r-dependence assumptions to use Van Der Vaart & Wellner 

[1996]'s results of 


tightness and asymptotic equicontinuity (under independence) together with the fidis 
convergence of the EPCF. 

This paper is organized as follows. In Section 2, we recall basic definitions and 
notations: cluster functionals, the triangular arrays (or normalized random variables 
excesses) and examples. Then, we give the definition of cluster functionals empirical 
processes and close the section with a simple version of the CFT of those empirical 
processes. In Section 3 we begin with the fidis convergence of the EPCF, followed by 
the conditions to obtain asymptotic tightness and asymptotic equicontinuity of thoses 
processes to obtain uniform convergence. We close this section with an application: block 
estimator of the extremal index. In Section 4 we develop a example similar to ([T]) for the 
multidimensional case for the case of the AR(l)-inputs |[2|. Also a simulation study for the 
extremal index to demonstrate the accuracy of this technique. The r-weak dependence 
with some examples and the proofs are reported in Appendix. 

2. Basic definitions and notations 


To define the empirical processes of cluster functionals it is necessary to consider two 
important ingredients: the cluster functionals and the excesses over high thresholds. 
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2.1. Cluster Functionals 


Let (£,£) be a measurable subspace of (R d , £?(R rf )) for some d ^ 1 such that 0 G E. 
Following the deterministic definition of Drees & Rootzen [2010 we consider the set of 
E-valued sequences of finite length, i.e., 


Eu ■= {(xi,...,x r ) : Xi G E V/ = 1,..., r; Vr G N}, 


equipped with the tr-field induced by Borel-c-fields on E r , for r G N. Let x G Ey, then 
we can write x = (xi,... ,x r ) for some r G IN. The cor€^]x c G Ej of x is defined by 

{ (x ri , x rj +i,..., x r$ ), if x 0 r (the null element in E r ) 

0, otherwise 


where rj := min{z G {1,...,r} : x, 0} (first non-null value of the block x) and 
r$ : = max{z G {1,...,r} : x, 7 ^ 0} (last non-null value of the block x). A cluster 
functional is a measurable map / : (Eu,£u) —> (R, t?(Rj) such that 

f(x) = /(x c ), for all x G E U/ and /(0 r ) = 0 (Vr ^ 1). (3) 

Under the properties (J3]>, it is easy to build a large amount of examples of cluster 
functionals. Nevertheless, the typical examples used to build estimators through these 
cluster functionals are functionals of the type: 

r 

/(xi,...,x r ) = £>(xi), (4) 

i=i 

where cp : E —>■ R is a measurable function such that (p(0) = 0. Generally speaking, 
these functions (p are indicator functions (or functions which are product of another 
measurable function H : E —> R with an indicator function). Another classic example is 
the component-wise maximum of a cluster: 

/(xi,...,x r ) = maxxj, (5) 

1 

for E = [0,oo). 

Under the set E = [0, 00 ), two particular examples that we shall only mention here 
with a view to motivating further work, are the following functionals: 

• Balanced periods at u > 0, 


/(xx r ) 


l{EU(x f — u)t{xi > 0} = 0}, if Xi > 0 for some i = 1,... ,r 
0, if Xi = 0 for all i = 1,..., r. 


Yun 


and 


1 This definition is given by 
2 Note that the core also considers the nul. 


Segers 12003 [, for the real case 


values that exist between the non-null values. For example. 
(0,1,2,4,0,3,0,1,0,0) c = (1,2,4,0,3,0,1), which is the smaller sub-block of x = (0,1,2,4,0,3,0,1,0,0) which 
contains all non-null values as well as the null values between them. 
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• Maximum sum (greater than the level u > 0) of consecutive excesses. 


f(x l,...,*r) 


max 

1 ^p<q^r 



E 



with the notation: = {x p ,Xp + i,.. .,*(/} (7 {xi,. • ■,} such that y > 0, Vy G 

H m , where p, q G {1,2,..., r}. 


2.2. The excesses and their normalizations 

Without going yet into formalities, first let us consider the following examples that 
motivate the use of the triangular arrays (X„,neN throughout this work. 

1.- Let be a real-valued stationary random process with marginal cumulative 

distribution function F and let (,u n ) n rK be a non-decreasing sequence of thresholds such 
that u n t Xf, where 


Xp = supjx G 1R : F(x ) < 1}, v n = P{Xi > u n \ —> 0. 

1 n-¥ oo 


If we want to study the process X' nj = X, — w„|Xi > w,„ first observe that the tail 
distribution function of X, may be asymptotically degenerated as n —? oo, which means 
that there exists a point a G R such that 

Pn(x) = P{Xi — U n > x\Xi > U n } -> t{x ^ a}. 

n —^oo 


However, if F belongs to the domain of attraction of some extreme-value distribution, then 
by a result in [Pickands 1975], there exists 7 G P and a sequence of positive constants 
(a n ) n e n (depending on the sequence u n ) such that 


P n (x) = P{X n 1 > x|Xi > U„} 


(1 + 7 *) + 1/7 , if 7 7^0 
e _x , if 7 = 0 


locally uniform in ( 0 , 00 ), where 

X, U n 


X n ,i = 


:= max 


X f u n 


,0 > , for 1 ^ i ^ n; 


( 6 ) 


are the normalized excesses of X, over u n . 

2 - For the multidimensional case we may consider the following example. For d ^ 1, 
let (X;),' eIN be a R d -valued random process such that X, = (X ^ 1 j , x| 2j ,..., Xxf ] ) admits 
coordinates with the same marginal distribution, then in this case, a standardization of X; 
is: 
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where (M n )neN and (a n ) ne w are defined as in eqn. ([ 6 ]). Here, X n j is the vector of 
normalized excesses over the threshold u n for each coordinate. 

Another interesting example is the normalization of d consecutive excesses of real¬ 
valued random variables (X;); 6 hj, i.e. 


X n ,i = 



Xj+i u n 


+ 


Xj'+d— 1 M 
Un 



( 8 ) 


This example is given in Section 3 - [ Drees & Rootzen, 2010 ]. Observe that this example 
is a particular case of the example of eqn. ( tJ> which corresponds to = X* + y _\ for 


1 <j< d.) 

Notice that this example brings information on the extremal dependence structure. Some 
applications of this standardization could be: 

(i) d consecutive days of rain are observed in a giveyi city, such that the volume of precipitations 
may be larger than the volume of water that can be drained (through sewers, soil, rivers, etc.), 

(ii) d very large claims are reported to an insurance company in a very small time interval (with 
respect to typical cases) which this can be a risk with respect to the response capacity of the 
insurance company, and 

(Hi) d consecutive days of low temperatures observed in a given city, such that the power consump¬ 
tion (due to heating, etc.) endangers the response capacity of the company in charge of the energy 
distribution. 

In a general way, let (E, 8) be a measurable subspace of (R rf , £>(]R rf )) for some d ^ 1 
such that 0 G E. We denote by (X„ ,)i^„ as the E-valued row-wise stationary standard¬ 
ized random variables, defined on some probability space (Q, A, P), which are built from 
a stationary random process (X ; ), c:Z/ in a way such that the standardization X n i maps 
all "non-extreme" values to zero. Additionally, it should satisfy that the sequence of 
conditional distributions of X„ i given that X n1 belongs of the failure set SCE\{ 0 } (i.e. 
P n (-|S) := P{X H/ i G - 1 X„j G S}), converge weakly to some non-degenerate limit. 


2.3. Empirical Processes of Cluster Functionals 

Now, we want to apply cluster functionals to blocks of E-valued random variables excesses 
over a determined thresholds sequence and to define the empirical process indexed by 
these functionals. 

In order to do that, first let us consider a row-wise stationary E-valued triangular 
array (X ) 3 /J )i^ n n 6 f,j, defined on some probability space (O, A, IP). 

Let Y n j be the j-th block of r„ consecutive values of the n-th row of (X, M ). That is, we 
have m n := [; n/r n ] = max{; GlN:j< n/r n } blocks 

~^n,j := (X n ,;') {j-X)r n +l^Kjr n (9) 

of length r n , with 1 C / C m n . In order to simplify future notations, since (X„ ; ,)i^„ is 

V 

stationary for each n, then we can denote by Y n to the "generic block" such that Y n = Y n ] . 
Let T be a class of cluster functionals. The "empirical process Z„ of cluster functionals" 
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in T, is the process (Z„(/)) f e jr defined by 


Zn(f) ■■= -j= EWn,j) ~ E/(Y„,;)), 


( 10 ) 


where v n := P{X„ i G S} and S is the failure set. 

In order to begin approaching the convergence in fidis of the EPCF (101, observe that 
if the blocks are independents and if we take in account the following 

essential convergence assumptions: 


(C.l) E 


(/(Y„) -E/(Y„)) 2 1{|/(Y„) -E/(Y„)| > ey/nvTi} = o(r n v„). 


for all e > 0, and for all / G T. 

(C.2) (r„u„) _1 Cov (/(Y„),g(Y„)) —> c(/,g), for all f,g G T, 

with r n « v~ 1 <C n, then the fidis of the empirical process ( Z n (f))f e jr of cluster function¬ 
als converge to the fidis of a Gaussian process (Z(f))f e jr with the covariance function 
c. 


Drees & Rootzen [20101 have proved CLTs for the process ([10}. In particular, they have 


proved CLTs in fidis for this process by using the Bernstein blocks technique together with 
a ^-mixing coupling condition to boil down convergence to convergence of sums over 
i.i.d. blocks through Eberlein [1984|'s technique involving the metric of total variation. 
Moreover, 


Drees & Rootzen 


[2010| extend the results to the uniform convergence by 


adding Van Der Vaart & Wellner [1996]'s tightness criteria and asymptotic equicontinuity 


conditions to the results in fidis that they had obtained. 

We aim at extending their CLT's for the empirical process (Z n (f))f e jr, since the family 
of mixing processes is still very restrictive. One particular and really simple example of 
a non-mixing process is the AR(l)-process Q. We derive some results as in [Drees & 
Rootzen, 20101 and some applications as in |Drees[ 20111 under much weaker dependence 
conditions including eg. this example. 

The r—weak dependence introduced by Dedecker & Prieur 1 2004a) holds for the Example 
in eqn. (|2jl, as well as more generally for Bernoulli shifts processes and Markov chains. 


3. Central limit theorems for cluster functionals 
3.1. Fidis convergence 

First we give the convergence in fidis. In this case, the technique used to prove the 
convergence of the empirical process ( fTO) is also the Bernstein blocks technique. In order 
to do this, we need to extract from each block Y n j of length r„ a sub-block of length l n , 
in such a way that l n = o(r n ). Then we use the remaining sub-blocks, separated by l n 
variables, combined with convenient conditions of r-dependence to couple independent 
blocks to the original blocks (originally dependent), and thus obtain the CLTs through 
classic tools. 

For this, it is necessary for the triangular array (X, M )i^„ „ G ]n to satisfy the relation: 
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(D.l) T\ n On) = Oijn 1 ), 

such that 

(B.l) l n n where l n — > oo and nv„ — » oo as n —> oo, and 

(B.2) E(||X n/ 1 |||X n/1 /0) <oo. 

On the other hand, we must not forget the influence of the small blocks extracted with 
length l n over the empirical process < |To] >. In this case, in order to introduce assumptions 
over these small blocks, it is necessary to consider the following notations, which we will 
also use throughout the rest of this paper. 

Notation 1 Let x = (Xi,X 2 ,... ,x r ). We will use the notation x^ as follows 


x m = 


0 , if r < l, 

(xi,...,x k ), ifl^ZO^r, 
Y, if k > r. 


and xM := Moreover, if / G J 7 is a cluster functional, then we denote 

A„(/):=/(V n ) ~ f{Y [ : n ~ ln] ), (11) 

where r n is the length of the block Y n and l n is such that l n = o(r„). 


The following assumption guarantees that the extraction of the small blocks does not 
disturb the result of the convergence in fidis (if that is the case) of the EPCF (Z„(/))y 6 jr. 

(C.3) For all / G T, 

E|A„(/) -EA h (/)| 2 1{|A, ; (/) -EA„(/)| ^ ^/W~n) =o(r n v n ) 

P{|A„(/) -EA„(/)| > sJW n } =o(r n /v n ). 


Theorem 1 Suppose that (B.1)-(B.2), (C.1)-(C.3) and (D.l) hold. Then the fidis of the cluster 
functionals empirical process ( Z n (f))f e j- converge to the fidis of a centered Gaussian process 
(Z(f))f e jr with covariance function c defined in assumption (C.2). 

Remark 1 Note that Assumptions (C.l) and (C.3) are difficult to check in general, for 
that reason, consider the following (more restrictive but easier to verify) alternatives 
conditions: 


(A.l) Var(A,i(/)) = o(r„v n ) for all / 6 J. 

(A.3) E(/(Y „)) 2+<5 = 0(r n v n ) for some 5 > 0 and for all f E IF. 

Lemma 1 The conditions (A.l) and (A.3) implies the conditions (C.l) and (C.3), respectively. 


7 




Doukhan & Gomez • ECLT for cluster functionals without mixing • February 29, 2016 


For the proof of this lemma see Femma 5.2 in [Drees & Rootzen, 20101. 

Generally, the (C.2) convergence can be easily verified. However, in some situations it 
could be difficult to give the limit c in an explicit mode. In this sense, it is common to 
use the "tail chain" associated to the original process This terminology was used 

by Perfekt 1 1994| , but generalized by Yun] |1998| to make explicit representations of the 
extremal index of a higher-order stationary Markov chain. Segers [20031 generalizes this 
result to stationary sequences with suitable conditions. 

We will follow Segers [ 2003} rationale to give an explicit representation of the covari¬ 
ance function c through the tail chains. The following proposition (which is a similar 


result to Segers [2003 ['s Theorems 1 and 3) provide some conditions, sufficient to verify 
(C.2) and which in some situations are easier to prove. The alternative expression to the 
covariance function c (defined in (C.2)), is shown below in Corollary 1. 

In order to carry this out, it is necessary to consider the following assumption: 

(C.2') There is a sequence W = (iV,);>i of E-valued random variables such that, for all 
k E IN, the joint conditional distribution 


P (X nji ,l{X„,i=0}h^ k \X nrl ^0 

converges weakly to P( W . ar *d for all f ET are a.s. continuous with respect 
to the distribution of = (Wi,..., W k ) and W^ 2:k 1 = (W 2 ,..., W k ) for all k, that is, 

P{WP*! G D f/k ~i, Wi = 0, Vz >k} = P{WW e D f/k ,Wi = 0, Vi > k} = 0 

where we denote by Df rk the set of discontinuities of f\ E k. 

Remark 2 The existence of such sequence W is guaranteed in particular from Theorem 2 
with E = IR and the normalization (J 6 j). There, Segers has shown that if 

p ((x*i)i<.-d*!>“») ~ log Gk ' 

where G k is some k— dimensional extreme value distribution for all k E N, then there 
exists such "tail chain" W = (W,) !G ]n such that 

p ((x B/i A{x M =o}) 1 ^ i fc|x 1 >«n) ¥ p (Wi,i{Ty=o}) uia / ( 12 ) 


in [Segers. 2003 


for all k E N. 

Proposition 1 Suppose that the r.v's (X n/! -)i^ n<Kg ]N satisfies the following condition: 

(D.2) There exists p, q ^ 1 with p 1 + q 1 = 1 and a > 0, such that 

limlimsup '\ p / r ‘^ ^ =0 and r n v]/ q —0 (13) 

l^oo „^ 00 l f l / P +a n->co 
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Then, 

E \f(Yn)\Y n ^ 0] = e-'E [f(Y n ,i)-f(Y%" ] ) \x n/1 £ o] + o(l), 
where o(l) converges to 0 as n —> oo uniformly for all bounded cluster functionals f G T, and 

On := F{Yn ^ Q} = P{y£ r ” ] = 0|X M/1 ^ 0}(1 + o(l)). 

Additionally, if the assumption (C.2') is satisfied, then: 


(14) 


m w 

On 

p /(y„)|y,+o 


n—>oo 
W 


sup{z ^ 1 : Wj -f 0} < oo, 

6 := P{W; = 0, Vz ^ 2} = P {m w = 1} > 0, 

\ (P{/(W) G •} — P{/(W[ 2 H) G ., mw ^ 2}) . 


Corollary 1 Suppose that 

? = {/l(/(r„) ) „ e nj is uniformly integrable under P(-)/r n v n }. 

Assume that (B.l), (B.2), (C.2'), (C.3) and (D.2) hold. Then thefidis of the cluster functionals 
empirical process (Z, ! (/))^ 6 j- converge to the fidis of a centered Gaussian process (Z(f))f e jr 
with covariance function c defined by 


c(f,g)=v\(fg)(W)-(fg)(W^ ] ) 


(15) 


There are many cases in which ||/||oo = sup x6£j \f{x)\ < for all / G T. Under this 
condition, it is clear that the conditions (C.l) and (C.3) are satisfied. Therefore, it is 
important to note the following corollary. 


Corollary 2 Suppose that (B.l), (B.2), (C.2') and (D.2) are satisfied. Then,if\\f\\^ = sup^g \f{x)\ < 
co for all f £ IF, thefidis of the cluster functionals empirical process (Z„(/))y 6 t converges to the 
fidis of a centered Gaussian process (Z(f))f e jr with covariance function c defined by [151 

3.2. Uniform convergence 

To prove uniform convergence, we use either asymptotic tightness of Z n in the space 


£°°(F), or asymptotic equicontinuity conditions, by some results in § 2.11 of Van Der 


Vaart & Wellner|1996|. Those results need independence therefore a argument of coupling 
for the blocks (Yn,j)\^j^m n ,n&i is a ls° used here. 


3.2.1 Asymptotic tightness 

Definition 1 The sequence (Z n ) ne f$ is asymptotically tight if for every e > 0 there exists a 
compact set K G £°°(F) such that 

limsupP*(Z„ £ K s ) < e, for every 5 > 0, 

n—yoo 

where K s = {/ G £°°{F) : djr(f,K) < £} is the "5—enlargement" around K and P* denotes the 
outer probability. 
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Definition 2 The bracketing number (e, T , L'f) is defined as the smallest number N e such 

that for each n E N there exits a partition ( °f F such that 

E* sup (/(Y„) - g{Y n )) 2 ^ e 2 r n v n , for 1 k < N e , 
where E* denotes the outer expectation. 

In order to use Theorem 2.11.9 in [Van Per Vaart & Wellner}|1996| we need: 

(T.l) The set T of cluster functionals is such that for each / E T the expression E/ 2 (Y„) 
is finite for all n E IN and such that the envelope function satisfies: 

F(x) := sup \f{x)\ <oo, Vr G Ey. 
fef 


(T.2) E* (F(Y„)1{F(Y„) > e^/nvf}) = o(r n y/v n /n ) for all e > 0. 

Note that for a sequence of monotonically increasing positive functions (h n (d)) n> >\ the 
convergence of h„(5„) to zero \/S n | 0 is equivalent to 


limlimsup/i„(J) = 0, 

n — 


thus the Assumptions 2 and 3 of Theorem 2.11.9 from [Van Der Vaart & Wellner 19961 
are reformulated as follows: 


(T.3) There exists a semi-metric p on T such that T is totally bounded with respect to 
(w.r.t.) p and 


limlimsup sup —^-E (/(Y„) - g(Y„)) 2 = 0. 

„_J. O0 f,geF:p(f,g)<S r n V n 


(T.4) 

lim lim sup J y/log N[.j (e, T , Vffde = 0. 

Theorem 2 Suppose that (B.l), (B.2), (D.l) hold and that (T.l) - (T.4) are satisfied. Then the 
empirical process (Z n ) ne ^ is asymptotically tight in £°°((.T ). Moreover, if the assumptions (C.l)- 
(C.3) hold, then Z n converges to a centered Gaussian process Z with covariance function c in 
(C.2). 


3.2.2 Asymptotic equicontinuity 

Definition 3 The sequence (Z n ) ne -^ is asymptotically equicontinuous w.r.t. a semi-metric p if 
for any e > 0 and q > 0 there exists some 5 > 0 such that: 


lim sup P* I sup | Z„(/) 

n ~^°° \f,geJ r :p(f,g)<S 


Zn(g) | > 


< rj. 


10 











Doukhan & Gomez • ECLT for cluster functionals without mixing • February 29, 2016 


We use Theorem 2.11.1 in [Van Der Vaart & Wellner. 1996) to prove asymptotic equicon- 
tinuity. In order to do this, we need to define a semi-metric p n on J~ as follows. Let 


be the independent copies of (Y n ,j)i^m„- 
We define p n as: 


Pn(f,g ) 



m n 


E(/( y y-s( y »',,)) 2 - 


(16) 


Moreover, we denote by N(e,(F,p ) the "covering number", the minimum number of balls 
(w.r.t. the semi-metric p) with radius e > 0 necessary to cover J~. In this way, we can add 
to the list of assumptions the following two: 

(T.4') For k = 1,2 the map 


(vi. y ,;. K/ 2]) 


sup 

f,geT:p(f,g)<i j =1 


r? 1 , \ l 


is measurable for each 5 > 0, each vector (e \,..., e^ mi ,/ 2 \) £ {— 1 , 0 , 1 }l m «/ 2 l and each 
n G IN. 


(T ' 5) 

limlimsupP* ( f \/log N (e, T , p n )de > // ) = 0, V// > 0. 

^10 n—>oo \J0 y / 

Theorem 3 Suppose that (B.l), (B.2), (D.l) hold and that (T.1)-(T.3), (T.4') and (T.5) are satisfied. 
Then the empirical process ( Z n ) ne jsj is asymptotically equicontinuous. Moreover if the assumptions 
(C.1)-(C.3) hold, then Z n converges to a centered Gaussian process Z with covariance function c 
in (C.2). 


3.3. Application: blocks estimator of the extremal index 

Let (X;)j £ ]n be a real stationary time series with distribution function F. Now consider 
the index defined in (141 with the extreme normalization ([ 6 ]) and u n := F' fl — v n t), for 
all t S [ 0 , 1 ], i.e. 


0 n ,t ■= 


P{Y„ 7 ^ 0} P{maxi <!<rii X, > u n } 


r n v n t 


r n v n t 


with t G (0,1]. 


(17) 


Note that if r n satisfies (B.l) condition and if (D.2) holds, then by using Proposition 1, 
there exists a number ( extremal index) 9 G (0,1] such that 


0n,t 


6 uniformly for fG( 0 , 1 ]. 


(18) 


Given the convergence ( [18) , Drees has suggested in his paper Drees [2011] to estimate 6 
replacing the unknown probability P{maxi<;,■<, i( X, > u u } and the unknown expectation 
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iV n t = E 11 { X, > u n }] by a empirical expression for 0 H/t : 


9n,t 


^{ max (j--l)r n <Kjr n X, > U„} 


E~ 1 E'LV 1)r . +1 l{x i > «„} 


(19) 


where m„ = [n/r„] such that 1 <r„< 1 n but nv n —> oo. Thus, such estimator 


( |T9| > (called blocks estimator of the extremal index ) can be expressed in terms of two empirical 
processes of cluster functionals (Z„(/ f ), Z n (g t ))o^t^i defined in (JToj). For this, suppose 
without loss of generality that the random variables (X, ) ]<,<;„ are uniformly distributed 
on [0,1] (otherwise, just consider the transformation If, = F(X ; ), 1 ^ i ^ n, where F is 
the distribution function of X\, see Drees 1 2011) ). Then, with the normalization ([ 6 ]) such 
that a n = v n = 1 — u„ and the blocks (Yn,j) i<j<m„ defined in 0 , we have that 


@n,t — _ j 


m n 1 Epift(Y n ,j) _ Ef t (Y nl ) + (nv n ) 1/2 m n 1 Z n (f t ) 


m n 1 E(=‘i gt(Y n ,j) Eg f (Y„ 4 ) + (nv„) 1 / 2 m n 1 Z n (g t )' 


( 20 ) 


where 


ft(xi,...,x r ) := l{max x t > 1 - t} 

r 

gt(xi,...,x r ) := £]ll{x,- >1-0- 

1=1 

For this particular case, we consider the following assumptions: 

(C.2.1) (r„u„) _ 1 Co v(g s (Y„),gt(y n )) —» c g (s,t), for all 0 ^ s,t < 1. 

(C.2.2) (r n v n )~ 1 Cov(f s (Y n ),gt(Y n )) —» c/ g (s,f), for all 0 ^ s,t ^ 1. 

(T) For some bounded function h : (0,1] — > R such that lim fH .o h(t) = 0 

(TnVn)- 1 ® (fs(Y n ,l) - ft(Yn,l)) 2 < h(t - s), V0 < S < f < 1, 


( 21 ) 

( 22 ) 


for all n sufficiently large. 


The following are a slight variation of the first two results of Drees [2011 [, in the sense 
that we replace the /3-mixing condition with r-dependence condition. 


Proposition 2 

(3.1) Suppose that (B.l), (B.2) and (D.l) are satisfied. Then (Z n (ft))o^t^l converges weakly to 
Zf := {VOB^o^t^i, where B denote a standard Brownian motion. 

(3.2) If additionally (C.2.1) and (T) are satisfied and r n = o(y/nv n ), then the sequence of 
processes (Z n (gf))o<«i converges weakly to a centered Gaussian process (Z g (t)) os^i 
with covariance function c g . 
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(3.3) Under all the hypothesis of (3.1) and (3.2), if moreover (C.2.2) holds, then ( Z n (f t ), Z n (g t ))o^t^i 
converge weakly to (Zf(t), Z g (t)) o^t^i with 

Cov(Zf{s),Zf(t)) = 0(s A t), 

Cov(Z g {s),Z g (t )) = c g {s,t), 

Cov(Zf(s),Z g (t)) = Cf g (s,t), 0<s,f^l. (23) 


Using the same argument in Remark 2, we can find explicit expressions for the 
covariance functions c g and Cf g as functions of the "tail chains" of This is, if for 

every k E N the distribution function of (Xi,...,, X k ) belongs to the domain of attraction 
of an extreme-value distribution, then there exist a sequence W = ( W,) ie _ [ N - such that (12 1 
hold. In such case: 


Cg(s,t ) = s A t + £ (P{Wi > 1 - s, W i+1 > 1 -t} + P{Wi > 1 - t, W f+1 > 1 — s}) 


i =1 


P{Wi > 1 — t,maxj^i YJj > 1 — s} 

Cf g (s,t) = ^ +E“iP{^i > 1 - s,Wi + i > 1 - f,max ;>2 Wy ^ 1 -s}, s < f, 
t s>t. 


Corollary 3 Under Proposition 2 - (3.3)'s assumptions, 

(\/nVnt(0n,i ^ Z := Zy — 0Zg, (24) 

n—>oo 

where Z is a Gaussian process such that EZ(f) = 0 and 

Cov(Z(s),Z(t)) = 0(s At — Cf g (s,t) — Cfg(t,s ) -I- d 2 Cg(s,t)). (25) 


4. Examples and Simulations 

4.1. AR(l)-process with the functional "number of excesses over x" 

We consider the AR(l)-process ([2]) where b ^ 2 is an integer, (f,) IC -_ in are i.i.d. and uni¬ 
formly distributed on the set U(b) := {0,1, ...,b — 1}. 

It is clear that Xo is uniformly distributed on [0,1]. Moreover we define the normal¬ 
ized random variables (X n/! -)i<i< n ,neN as i n eqn. with a„ = v n = 1 — u n . We set 
(xj,..., x d ) ^ (i/i,... , y d ) if and only if x; ^ y u for all i = 1,..., d in case x ,y G [0, l] rf . 
Then 


P{X, U > x|X, u / 0} 


j lf ...,j d eu(b) 


s =1 


—> max {b* d (l — xd}, 

n-*> i=l,...,d 


(26) 
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where v n : = F{X n l / 0} ~ v n = P{Xi > u n } - 

n- 

Consider J~ the family of cluster functionals: 


0 . 


? = \fx, x E [0,l] d j, with f x (xi,...,x r ) = > x} 

1 J i =1 

For this case, we obtain the covariance function c of (C.2): 

c{x,y)=mm[ max {b k (1 — x k )}, max {b k (l — y k )} ) 

\k=l,...,d k=l,...,d J 

CO CO 

+ Yj H bA x 'V) + J^ H b,i(y r x), 

i=l i =1 

where, for z ^ d 

H b ,i(x,y) ■= p min ^ fc max rf {b fc (l - x k )} , maxJb k+t {1 - y k )}^j 
and for 1 ^ i < d, 


(27) 


(28) 


(29) 


H bri {x,y) 


:= — mm 
b 1 


( max {b k (l — x k )}, max {b k min(l — x k , 1 — y k )}, max {b k+l (l — y k )} ) 
\k=l...,i k=i+l,...,d k=d—i,...,d J 


Conditions (C.l), (C.3), (T.1)-(T.4) hold for uniformly distributed random variables and 
for the same family J~, see page 2177 and 2178 in Drees & Rootzen [20101- Thus, under 
assumption (B.l), setting l n and r n such that b ~ ln = o(r~ 1 ) (see Appendix A.l - Application 
1 ) then the empirical process (Z n (x )) xe ^ 0 defined as: 

1 r n m n 

Z n (x) := —= £ (l{X n ,i > *} - P{X„, f > x}) 

^nv n ■ _ j 


— J2(HX n ,i>x}-F{X nri >x}) 


(30) 


converges to a centered Gaussian process Z with covariance function (281. 


4.2. Simulation study 

The experiment is to estimate the extremal index 6 through the blocks estimator of the 
extremal index |20| . 

Fet us consider the AR(l)-process Q. Here, as Xo is uniformly distributed on [0,1] 
and Xj = + X^ = i p4f+r for all i ^ 1, we will take this to obtain a theoretical expression 

for the index ( fTz) ) with u n = 1 — v n t for t E (0,1]: 


@n,t — 


b r »r n v n t . 


E 


mm 


j 1 ,...,j rn eU(b) 


max < 1 — b l (1 — v n t) + Y] b s 
r n I S=1 


Is 


(31) 
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Figure 1: Extremal index for the AR(l)-inpnt 

Left: d n=W i i is the blue curve, 0 n=W 4 , t is the black curve and the confidence intervals 
It — {Gift), CI s (t )) are the red curves, for the AR(l)-input (J2| with b — 2. Right: the same 

situation but with b — 3 


which converges to some 9 = 9(b) E (0,1) if (B.l) is satisfied with b~ ln = o(r~ 1 v„), for 
some p > 0 . 

We will use the advantage of having this theoretical expression ( f31~| of the ex¬ 
tremal index 9 Ur t to compare it to its asymptotic estimations 9 n ,t- For this, we simulate 
AR(1)— processes (j2j) for b = 2,3 and their blocks estimators (20' respective with the nor¬ 
malization ([6]) taking a n = v n = 1 — u n to make the comparison of the results estimated 
with the theoretic model pi) . 

Let us suppose we have data of a size M = 600000 which adjusts appropriately 
to an AR(l)-process Q for b = 2 (b = 3). Because the process is stationary, we can 
divide this data into N = 60 blocks of length n = 10 4 . Moreover, we choose a threshold 
u n such that v n = n _1//2 and the sub-blocks of lenght r n = [log(n)]. In Figure 1 we 
showed a polygonal curve (t, 0 n= 1 oyf)f=o.i,o. 2 ,...,i (blue curve) of (t/ 0 „=io 4 / f)o^t<i an d a 
mean polygonal curve (t,9 n=10 i /t )t=o.i,o.2,...,i (black curve) of (t,9 n=10 i /t ) t=0 ^i- Note that 
for n = 10 4 , the symmetry of the confidence band (Ct,(f), Ct s (f))o<^i with respect to 
(9n,t)()<i<;\ and with a confidence level 1 — a. = 0.95 (red curves), already shows the 
gaussian behaviour of the estimator. Furthermore, as expected, the estimated value 
through the blocks estimator is quite close to the extremal index theoretical pTj , with 
n = 10 4 . The numerical results are shown in Tables 1 and 2, for the cases b = 2 and b = 3, 
respectively. 
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t 

0.1 

0.2 

0.3 

0.4 

0.5 

0.6 

0.7 

0.8 

0.9 

1.0 

@n,t 

0.555 

0.555 

0.555 

0.555 

0.555 

0.555 

0.555 

0.553 

0.553 

0.553 

9n,t 

0.606 

0.572 

0.568 

0.569 

0.565 

0.568 

0.562 

0.560 

0.559 

0.557 

CIs 

0.638 

0.593 

0.585 

0.585 

0.579 

0.581 

0.573 

0.571 

0.569 

0.566 

Cli 

0.575 

0.551 

0.551 

0.553 

0.551 

0.556 

0.551 

0.549 

0.549 

0.548 

le 1: Comparison between the blocks estimation and the theoretical approximation (31 \ for the AI 
input with b — 2 and n — 10 4 . 

t 

0.1 

0.2 

0.3 

0.4 

0.5 

0.6 

0.7 

0.8 

0.9 

1.0 

9 n ,t 

0.703 

0.702 

0.702 

0.702 

0.700 

0.700 

0.699 

0.699 

0.697 

0.697 

On ,t 

0.747 

0.718 

0.704 

0.711 

0.710 

0.708 

0.707 

0.704 

0.698 

0.699 

CIs 

0.775 

0.739 

0.720 

0.725 

0.723 

0.720 

0.718 

0.715 

0.708 

0.709 

Cf 

0.718 

0.698 

0.689 

0.697 

0.697 

0.695 

0.696 

0.694 

0.688 

0.689 


Table 2: Comparison between the blocks estimation and the theoretical approximation (31 \ for the AR(1)- 
input with b — 3 and n = 10 4 

Appendix, t-weak dependence and proofs 

A.l. Brief interlude into T-weak dependence 

Let (Q, A, P) be a probability space, and M. a c-algebra of A. Let (E,5) be a Polish 
space endowed with its metric. For any E-valued random variable X, U'-integrable (i.e. 


X satisfies ||<5(X,0)|| p := (f t>P(x,0)X(dx)y /p < oo). 


the coefficient r„ as: 


Dedecker & Prieur 


r p (M, X) := || sup {E [h(X)\Ai] — E |7z(X)] : h G A(E,S)} \\ v 
where A (E,5) denotes the class of all Lipschitz functions h : E —> R such that 

Lip(E) := sup < a. 


12004a | defined 
(32) 




5(x,y) 


Let X = (X n ,;)i <^ n:U eK be a triangular array of KZ-integrable E-valued random 
variables, and (be a sequence of c-algebras of A. 

Then, for any n G IN, we define the coefficient: 

T p,n(k) := sup r 1 sup {r p (Mi, (X njl ,..., X„^)) : i + k ^ f <•••</'/ ^ n}, (33) 


where we consider the distance 


i=l 




on E l . Moreover, we say that X is p,-weakly dependent if 

lim lim sup r P/M (k) = 0. 


(34) 


(35) 
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Remark 3 Recall that X = (X„,*)i^t<» /fJ eN is constructed from a random process X = 
(X;),- 6 z- Therefore the dependence properties of X's are inherited from those of X. Even 
more so, if X is T p -weakly dependent (in the usual sense defined by Dedecker & Prieur 
12004a | for random processes), then X is r p - weakly dependent with T P/ „ (•) = L n ■ r p (-), 
for some positive constant L n (which is written in function of X's normalization constants). 
For instance, if we consider the normalizations in eqn. (|6jl-(|8j) we obtain that r P/ „(■) X 
An 1 -Tp(-). 

In this sense, if we want to study X’s r-dependence properties, suffice it to take into 
account X's r-dependence properties. 


We make use of the previous remark to mention the following examples of p,-weakly 
dependent processes without considering the normalizations. 

Example 1 (Causal Bernoulli shifts) Let (fi); 6 z be a sequence of i.i.d.r.v's. (independent 
and identically distributed random variables) with values in a measurable space D. 
Assume that there exists a function H : D N —> R, such that H(£o,£-i,...) is defined 
almost surely. Then the stationary sequence (X;),-^o defined by X, = ]!(&,&- 1 ,- •.) is 
called a causal Bernoulli shifts. 

Let (£-)* e z be an independent copy of the i.i.d. sequence Consider a decreasing 

sequence (A p (z)) ; >o such that 

||x, -x;||p < Ap(0 (36) 

for some p G [1, oo], where X, = H(&, £;_i,...) and X' = H(f u ..., £i, £(,, ...). Then, 

if Ap = cr(Xj : j X i), the coefficient T p (k) of (X,)f 6 z is bounded above by A p (k), for all 
k G N. 


Application 1 (Causal linear processes) Let D = R and 


X i = Y J b£i-j. 
i =o 


(37) 


Hem we set A' p (z) = 2||£ 0 || P D^ 


^ A p (z 


in case ||£ 0 |L < oo. 


T/ze zzzodeZ |2jl czzzz be nznte zzs (37> with bj = b~i _1 for some integer b ^ 2. Since £o is uniformly 
distributed on U(b) = {0,... ,b — 1} zn this case , Pzez-z Xo zs uniformly distributed over [0,1] 
and Aoo(z) ^ b _! . 


Example 2 (Markov models) Let G : (R 1 , B(R 1 )) x {D,T>) —» (R,£?(R)) be a measur¬ 
able function and let (X,) ; ^i _/ be a sequence of random variables with values in R such 
that 


X; = G(X f _ 1/ X/_ 2 ,..., Xi_i ;&), Vz > 1, (38) 

for some sequence (^)j 6 N of i.i.d.r.v's. with values in a measurable space D and inde¬ 
pendent of (Xo,..., Xi_/). Then the random variables Y,- = (X t , X,_i,..., X;_j + i) defines 
a Markov chain such that Y, = F(Y;_i; £,) with 

:= (G(x t ,... ,*i;£),Xz,Xz-i,.. .,x 2 ). (39) 
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Assume that Yq = (Xq, ..., X^_i) is a stationary solution of (38 i. Let Y' Q = (Xq,..., X(^) 


be independent of {Yq, (£i)ie n) and distributed as Yq. Then setting 

Xf = G(x;_ 1 . 


(40) 


X- is distributed as X, and it is independent of Mo = a(X q, ..., Xj ._./), for all i G N. As 
in the previous example, let (A p (i));^o be a non increasing sequence such that ( [36| holds, 
with X, and X- defined in (381 and ( |40| , respectively. Hence one can apply the result of 
Lemma 3 in [Dedecker & Prieur, 2004a|, and we obtain that T p (k) ^ A p(k). 


In particular if G is such that 


||G(x;^i)-G(y;^i)|| p < with < 1, 

1=1 1 = 1 


(41) 


then Ap(k) X Ca k for some a G [0,1) and some C > 0. (see [Dedecker et al. 2007], page 
34). 


Application 2 (Contracting Markov chain) Let X, = G(X,-_i, £;) fee a Markov chain such 
that G : (R,£>(R)) x (D,U) —> (R,£?(R)) is a measurable function and 


A = ||G(0;£i)|| p <oo and ||G(*;£i) - G(y;£i)||p ^ a\x-y\, 


(42) 


for some a G (0,1) and some p G [1, oo]. Then, (X,);^ has a stationary solution with p-th order 
finite moment as this is proved on page 35 of\Dedecker et al.\l^2007'j. Moreover under this condition: 
Ap(k) = 11 Xq — Xq ||p- a k . 


Remark 4 In particular if G(u;£) = A(u) + B(u){ for suitable Lipschitz functions A(u) 
and B(u) with u G R, then the corresponding iterative model (ARCH-type process) 
Xj = G(Xj_i;£;) satisfies (42' with a = Lip (A) + ||£i||pLip(B) < 1. 


Remark 5 The stationary iterative models X, = G(X ! _i,^) 
the condition (|42| holds; this is proved in Proposition 3.2 in 


are causal Bernoulli shifts if 
|Dedecker et al. 2007[. 


Application 3 (Nonlinear AR(Z)-models) Let l > 1 and (X,), be the stationary solution of 
some equation 

Xj = R(Xi_i ,..., X;_/) + £; 

for some measurable function R : R ? —> R. The process (X;), is then called a stationary real 
nonlinear autoregressive model of order 1. If ||£i||p < oo and 


i i 

|R(»i,...,«/) - R(vi,... ,vi)\ ^ E ai\ui — vi\, for fli,... ,ai ^ 0 with ^ a, < 1, 

i=l i =1 


and for all (u\,... ,Uj), {v\, ..., v{) G R*, then the function G : R /+1 —> R defined by G(u ; £) = 
R(u) + £ satisfies Condition (41 1 and therefore the sequence (A v {k))i admits an exponential decay 
rate. 
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A. 2 . Coupling 


Letdy :E[jxE[j —» [0,oo) be a pseudometric defined as follows: let x k = .. ,x^) E 

Eu for k = 1,2 and we denote 




1,2 


* 1 / 


/ ( 1 ) 

(*i 


(n) „(n+i) 


if ri ^ r 2 , 
x ( 2 ri) ), if r x < r 2 . 


Similarly is denoted x 2 ,. Then, 


dy (-^-l/-*" 2 ) ■—^r 1 Vr 2 (^/ X 2,i)' 

where d r (-, •) is defined in (|34j). 


(43) 


Lemma 2 (Coupling: the even and odd blocks) Suppose that the random variables (X n/I -)i^ n 
are such that (B.2) holds. We consider together even and odd block sizes, by using k = 0 or 1 
according to the parity. Assume that for each j E {2, ..., [m n / 2]} there is a random variable 
Uk j uniformly distributed on [ 0 , 1 ] and independent of A4 k j_ 1 = cr(Y nr2 _ k , ■ ■ ■, Y n , 2 (j-i)-k)) 
and c(Yn r2 j-ic). Then there exists a random block Y n/2 j-k measurable with respect to V 

v(Y n2 j-f) V a(U k ,j)> independent of •_ 1 and distributed as Y n2 j-k snc/z f/zaf 


IE (dy {Y n 2j—k, Y n 2j— fc)) | JYl n j_i r nC,n{^n)• 


(44) 


In particular, if we set M k n ^_ x = cr(f { n ^_ k ,..., Y„ / 2 ( ; _i)_a;), (ten f/ze blocks (Y, t/2; -_ fc ) i^ ;s g[ jnn / 2 ] 
are independent. 


Proof: We set here k = 0 (for even block sizes) since the steps are similar if k = 1. 

Let Y n 2 j = (Y nA ) ieA be a random block, where/1 y = { (2j — 1 )r„ + 1,..., 2 /r„ }. Then, 
[2007['s Lemma 5.3, there exists a random random block Y„ /2; £ 


from 


Dedecker et al. 


E r " measurable with respect to •_ 1 V a(Y Hi2 j) V t 7 ~( 1/ 0 / -) independent of and 

distributed as Y„ /2; such that 


r p(J^-n,j-l'Yn,2j) — 


E (d rn (Yn, 2j ,Yn,2j) M° n/hl ) 


(45) 


Therefore, 


E(dy (Y„ r 2j,Y nr 2j) M® h ) = E(d r „ {Y nr2j ,Y n ,2j) 


J n,j— 1 ) 


= ri{M° ni j_ 1 ,Y ni2 j) <r n T lin {r n ). □ (46) 


Lemma 3 (Coupling: the sub-blocks) Suppose that the random variables (X f!/ ,)i^„ are such 
(B.2) holds. Moreover, if that for each j E {2,..., [m n /2]} there is a random variable U k/ j uni¬ 
formly distributed on [0,1] and independent of the cr—algebras Ai n ,j-i = c(Y^f~ ln \ ..., Y^JY^) 
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and a-(Y^ ^). Then there exists a random block Yf'j hli , measurable with respect to 1 V 

aiY^I l ’^) V <j(Uj), independent of M. n j _i and distributed as Y^'j 1 such that 

\v(dUY!:f ln] ,r [ ;f ln] )) | M n ,j-i\ x ^ r n r lrn (l n ). (47) 

Moreover, if we set M, h j- 1 = cr(Y^~ ln \...,Y^‘-Z^) then the blocks (Y^'-~ ln ^)i^j^ mn are 
independent. 

Proof. The same argument previous proof. However note that the sub-blocks (Y^J ^ )i 
are separated by l n variables. □ 


T'h It] 


A.3. Proof of Theorem 1 

Let be the blocks built from For k G {0,1}, we consider the 

independent blocks (Y„ /2; -_ fc )i^:[ m „/ 2 ] coupled to the original blocks {Y n , 2 j-k)i^[ m „/ 2 p 


from Lemma 2. Therefore, if we define A* - := /(Y„y) — f(Y^J for ., rn n , we 


v 


v 


have that A* ■(/) = A „,y(/) = A„(/), for each where A n ,j{f) ■= f{Y n ,j ) - /(Y^” /n) ) and 


A„(/) is defined in (111. Now, if we consider the assumption (C.l), we can apply Petrov 
|1975|'s Theorem 1 (Section IX.l) to the i.i.d.r.v's X n y := (nu„) -1/2 A*y(/), so 


DZ< l) (/):=^= E (A; 4 . t (/) -ea;, 2j •_,(/)) =op(l) 


2 [m„/2] 


/T[ 

for k = 0,1. In consequence, 

j m, 


DZ n {f) := -4= ^ ( A n,;(/) - E M/)) = o P (l) 

\J nv n j_Y 


On the other hand, by Lemma 3, we have that 

-i m 

1 V i nr 


BZ„(J) := -^= E /(Xir ) - E /( Y S' 
;=i 


converge weakly in fidis if, and only if 

1 »In 


BZ n (f) := 


M 




(r„-ln) 


(48) 


(49) 


(50) 


(51) 


converge weakly in fidis (and in this case the limit distributions are the same). The 
latter holds because BZ n (f ) = Z„(/) — DZ n (f) and from the assumptions (C.2) and (C.3), 
where 

1 m n 

Z n (f) ■■= -*= E (/(*W) - e /(Ej)) • 


/nu, ; “ 

Finally, as Z„(/) = BZ n (f ) + DZ n (f) V/ G J 7 , we get the result. 


□ 
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A.4. Proof of Proposition 1 

Suffices to prove the following multidimensional version of Segers [2003|'s condition ( 6 ): 

lim limsup P j yf +1 - r ^ f 0|X„ i f 0 j =0, (52) 

/—>O0 n _). oo 1 j 

since the rest of the proof follows the same steps of the proof of Drees & Rootzen 1 2010 |'s 
Lemma 5.2. 

Indeed, let hf) = !{• f 0} be a function defined on E r,r 1 . Consider a increasing 
sequence of functions /!*■(•) : E r,r l —? [0,1] which approximate to h, and such that 
Lip(%) = for some a > 0. Of course, we set k = k{n) <Y n. Then, 

limsupP{Y^ ,+lr '^ f 0|X (!/ i f 0} = limsupu~ 1 P{Y^ ,+lr "' ) f 0,X„i f 0} 


= lim sup v n 

n— >oo 

= lim sup v n 1 

n—¥ oo 

= limsup^ 


0 } 

[ 

{x„,iY0} 


P{Y,[ m:r ") ^ 0\a{X nrl )}dF 

E[l{Y 1 ? +1:r " ) /0}|<7-(X n/1 )]dP 


E 


CO V k 0} 


h k (Y% 




k(x„,i 


dP 


Y lim sup 


k —,>oo 

Y lim sup 

k —>oo 


0L + l/p 


E 


ftlk(rf +1:r " ] ) 


^(X„,i) 


( PYr77V P (0 + (n - 


L V k 


,. TnT n ,p(l) 

Y limsup g+1/p ■ 


fc—>oo U, 


Finally, taking l —> oo we have the limit ([52]) proven. 


□ 


A.5. Proof of Theorem 2 

‘ (fc) 

Note that Z n is asymptotically tight iff Z„ defined by: 

zi n\f):=^= E {f( y n,2j-k) - E/(Y, !/2; -_ fc )) (53) 

y/i+Vn j = i 

is asymptotically tight for each k E {0,1}. On the other hand, for each k e { 0 , 1 } 
we use Lemma 2 together with (B.2) and (D.l) conditions to build independent blocks 
{y n , 2 j-k)i^[m n / 2 } coupled to the original blocks (Y n , 2 j-k)i^[m„/ 2 ]- In this manner we 

fjA 

have that Z„ is asymptotically tight iff 

zf (/):=—= E {f^n,2 j -k)-^f{Yn,2 j -k)) (54) 

j = i 
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is asymptotically tight, for each k £ {0,1}. The latter is true due to Theorem 2.11.9 in 
|Van Der Vaart & Wellner 1996| by setting Z„y(/) = f(Y n ,j) and [m n /2] instead of m n . 
For the remaining assertion we use Theorem 1. □ 


A.6. Proof of Theorem 3 

Consider (T.5). Note that from the triangle inequality Z n is asymptotically equicontinuous 
if Z„ from eqn. (531 is asymptotically equicontinuous for each k £ {0,1}. Now, again 
we use Femma 2 together with (B.2) and (D.l) conditions as in the previous proof to 
prove that is asymptotically equicontinuous iff is asymptotically equicontinuous 
for each k £ {0,1}. However, in this case Z® is asymptotically equicontinuous from 
Theorem 2.11.1 in |Van Der Vaart & Wellner 1996| . 

The remaining steps are the same of Theorem 2.10's proof in [Drees & Rootzen, 20101. □ 


A.7. Proof of Proposition 2 

The steps are the same that in the proof of Theorem 2.1 in [Drees. 20TT), but replacing the 
assumptions (Cl) and (C2) of his paper by our assumptions (B.2) and (B.l), respectively. 

□ 


A.8. Proof of Corollary 3 


Suffices to replace the assumptions (Cl) and (C2) in the proof of Drees [ 2011) 's Corollary 
2.3 by our assumptions (B.2) and (B.l) respectively. □ 


A.9. Proof of the expression 
If (Xj)^o is the AR(l)-process Q, note that for each i £ N 

*=$+£ pin < 55 > 

U S=1 u 

Therefore, if n is sufficiently large such that b d v n < 1, then for x £ [0, l} d : 

P {X nA > x r X nA f0} = P{X; > ci n Xi T Unr for some i — 1,... ,d} 

= P < X 0 > b l (a n Xj + u n ) — Yf b^Zs, for some i = 1,... ,d > 
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= P < Xn > min 

i=l,-,d 


b\a n Xi + u„) ~Yfjb s 1 £ s 


S=1 


= p S x o> min \ b\a n Xi + u n ) - b s 1 j s (£ 1 ,... ,£ d ) = (;'i,... ,j d ) 

ji,...,j d eu(b) l !=1 . d l 8 =i J 


= 7 -j YL P < X 0 > min l b l (a n Xi + u„) - £ 
D h,...,j d eu(b ) l '- 1 d l s=i 


- E 


i 

¥ 


E 


( max 


l-fc ! ' + E b ^h + ^n(l 

S=1 


A 1 


since y b (ji,..., j d ;i) := 1 - &*' + Es=i < -1 for all G U d {b)\{(b- 

1, ... ,b — 1)} and y b {b — l,b — 1,..., b — 1) =0. Thus, 

P{X M/ i > x|X„,i 7 ^ 0} —> max {b^ d ( 1 - x,)}. (56) 

;i—>oo 

□ 


A.10. Proof of the expression [28 


Let x,y E [0, l] rf . Then as before for i ^ 1, if n is sufficiently large such that b l+d v n < 1, 
then we have: 

P{ X I!,1 A X, X,; > X /} 

= P {X /c > a n x k + u n , X i+ i > a n \ji + u n , for some ( k,j ) E {1,..., d} 2 } 


= P jx 0 > k r ff n d + u n ) - | , 

Xj > ; min^ lb l (a n yi + u n ) - j 

Yf P<X 0 > min i b k (a„x k + u n ) - £& s_1 / s l, 

h....,j d eu(b) { k-X,...,d s=l J 

ji+l'~'ji+d€U(b) 

X { > min I b l (a n yi + u n ) - Vb^js+i \ , 

1=1 . d [ S=1 J 

((,1/ • • • / • • • / (/1/ • • • / ji+\r • • • / ji+d) } 
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= p E P ^o> mm 


h,-,id eU ( b ) 

ji+l'--’ii+d£U{b) 

X; > min 


nin i b k {a n x k + u n ) -Yb s 1 j s \ , 

1 .I S=1 I 


I ; I 

min b^anX/i + iin) - Y^js+i = ( ji,---,jd ) 

;=1 d I s=l I 


= p E p^ 0 >i- 


h'-'ici eU W 

ji + X,—,ji + 4€U(b) 


max 

k=l,...,d 


ji,... ,jd,k)+ b k v n { l-xjc)} Al, 


X; > 1 - 


; max rf {Mii+i,...,j i+d ) + ^«(1 - yi)} + A h (£i,. • .,£d) = (ji, ■ ■ • //<*)} 

= Ep jx 0 > 1 -^max^ -**)}, 

X/ > 1 - ? max^ -y/)},£i = ... = U = ft - 1 j 

since }L h (j lr ... ,j d ; i) := 1 - fe f + H=i ^“Vs < -1 for all G lt rf (fr)\{(b- 

1 ,... ,£> — 1)} and — 1, — 1,..., — 1) = 0. 

Moreover, note that if i > d 

P{X, U > x,X nii+ 1 > y} = Ep |x 0 > 1 - ^max^ {Vi7„(l - x k )} , 

Xj >1 ; max rf <jVu„(l -y/)},?i = ... = = 6- l| 

= Ep jx 0 > 1 - fc max^ <{Vu„(l -x fc )}, 

Xo > b‘ — b l max | b l v n (l — yi)X + 1 — b d — Y k s_1 £s 1 


E ^ P<jx 0 >l- max |— x fc ) |, 




jd+ i,...,jjSU(b) 


k =1. d 


Xq> b l — b‘ max b 


1=1,...,d 


|b , U„(l-y / )}+l-b lf - ^ Vs/(&+!/• ••/£«) = (jd+i,. ..,ji) 


s — d.-\-\ 


U . — 

id+i,...,ji^U(b) 


E F {x 0 >l - max \b k v n ( 1 - x k )\ , 

■ 6 U{b) f k-l,...,d l J 

X 0 >b l -b' max \b l v n (l - yi)\ + 1 - b d - Y ^ Vs 
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= 1 E -**)}, 

]d+i,...,j i eU(b) \ 


max 

1=1,...A 


b d + Yj & S_1 ^ + &' +i 0n(l-yz)-& , f /I 
s—d+1 J a- > 


0 J! - 

= —r mm 

b' 




max 
1=1,...,d 


{&' +i (l-yz)}) 


Similarly for 1 ^ i < d, we obtain that 


v n H b/i (x,y) 


P{X, U > x, X n > j/} 

= ^4 min ^ max {b k (l — x b )}, max {t^min(l — x b ,l — yf)}, max {b k+l (l — yf)} 

= v n H bii (x,y) 


From Femma 5.2 - (iii) in 
sufficiently large: 


[Drees & Rootzen, 2010], E|/(Y„)| 


Thus, for n 


Cov (f x (Y n ),f y (Y n )) 


r n Vn 
r, i-l 


P{X M/ 1 > x, X n l >y} 


i =l 


mm max 


+ 1 ) (P{X,a > x,X n4+ i > y} + P{x, a > y, X„, i+1 > x}) 




{&*(! - **)}/ max {b fc (l - yjt)} + ( H bri (x,y ) + H b/i (y,x)). 


k=1,... 


i=1 


□ 


A.ll. Proof of the expression (31 

The proof is similar to the proof of the expression (|26|. Indeed, 


P < max X; > 1 — v n t > = P {X; > 1 — v n t, for some i = 1,...,r n \ 

1 l^Kr„ 1 


= P X 0 > b’(l - v„t) ~Y bS ts/ for some i = 1,..., 


S = 1 


= P < X 0 > min < b'(l - u„f) - Y b$ 

I W, I S=1 
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= V P < Xo > mm 

1 ^i^r„ 


b 1 (1 - V n t) ~yb s % (ft,. . ., ft„ ) = Ol, • • • ,/r„ ) 


S = 1 


= ^ E p | Xo > I ^ - E bS V. 

0 h,...,j rn eu(b) 1 

1 


S = 1 


-— y min max 

Jr n eU{b) 


1 + E b s 1 js -b\ 1 - 0 „f) | , 1J . 


□ 
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